PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.002G029900.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family MYB
Protein Properties Length: 1035aa    MW: 114299 Da    PI: 9.7213
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.002G029900.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding26.81.2e-08412455346
                           SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
       Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                           +WT++E+ +l+  +++ G+ +W  Ia t+g++Rt+ qc  r+q 
  Sobic.002G029900.1.p 412 AWTADEETKLLLIIQEKGMCNWINIAVTLGTHRTPFQCLVRYQR 455
                           8**************************************99995 PP

2Myb_DNA-binding42.91.1e-13464508248
                           SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
       Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                           ++WT+eEd++l  av+ +G + W ++++ +  gRt+ qc +rw+k l
  Sobic.002G029900.1.p 464 KAWTKEEDLQLQAAVETFGQK-WQLVSASLD-GRTGTQCSNRWRKTL 508
                           58*******************.*********.************975 PP

3Myb_DNA-binding30.11.2e-09516560247
                           SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
       Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                           grW  +Ed++l   vk+ G+g+W++Ia  ++ gRt  q+ +rw + 
  Sobic.002G029900.1.p 516 GRWLLDEDKRLMVTVKLIGPGRWSLIAPFIP-GRTQTQIFERWCNI 560
                           89*****************************.***********996 PP

4Myb_DNA-binding31.63.9e-10569612347
                           SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
       Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47 
                            W +eEd  l+  v ++G+  W++Ia+t+  gR + +c  rw+k+
  Sobic.002G029900.1.p 569 DWRPEEDSMLLASVSEFGPC-WSKIAKTIIPGRNDSMCYRRWRKL 612
                           5*****************88.********99************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.028310402IPR017877Myb-like domain
SMARTSM007175.5E-5310406IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.606.8E-12312329IPR009057Homeodomain-like
CDDcd001670.00141369402No hitNo description
Gene3DG3DSA:1.10.10.606.8E-12376417IPR009057Homeodomain-like
SuperFamilySSF466891.48E-5379422IPR009057Homeodomain-like
PROSITE profilePS500908.2405457IPR017877Myb-like domain
SMARTSM007177.6E-6409459IPR001005SANT/Myb domain
SuperFamilySSF466891.98E-21409504IPR009057Homeodomain-like
PfamPF002495.0E-6411455IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.7E-11418463IPR009057Homeodomain-like
PROSITE profilePS5129418.973458512IPR017930Myb domain
SMARTSM007171.6E-11462510IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.2E-15464510IPR009057Homeodomain-like
PfamPF002492.2E-12464507IPR001005SANT/Myb domain
CDDcd001679.97E-10465508No hitNo description
SMARTSM007172.5E-7514563IPR001005SANT/Myb domain
SuperFamilySSF466891.76E-21515609IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.8E-13515561IPR009057Homeodomain-like
PROSITE profilePS5129415.069516565IPR017930Myb domain
PfamPF002491.1E-8516561IPR001005SANT/Myb domain
CDDcd001676.65E-7517561No hitNo description
Gene3DG3DSA:1.10.10.606.5E-14562616IPR009057Homeodomain-like
SMARTSM007172.6E-7566615IPR001005SANT/Myb domain
PROSITE profilePS5129410.532568617IPR017930Myb domain
PfamPF002492.6E-7569612IPR001005SANT/Myb domain
CDDcd001672.29E-8570612No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1035 aa     Download sequence    Send to blast
MDWYSDDSDP DIDEDLREDL DALRRSCILS GADPDAAVAQ VSSGLLAGPS TPALASAAPG  60
AAANHHHASS SDDDEEEDED LALVRTIRAN LHHLHNNKAS PAAPRADDGD PSSSPRPICT  120
WPPSDTDEEE DDLETLRAIQ RRFSHYQSST STASPKTMKP EASQGVHSDL FADRPDDDLA  180
VQKQNANAPH RDGFPKAALL LVDALKKNRA CQKFIRRKMV NIEAKIEENK DLRDRVKCLL  240
GYQLSCRKSV GRSLGQKEDP RVRLISPLKS TQPCSKNKYR KMPALFLGPA ENPHVSKYEM  300
VLKQFPLSFK KQPWSDAEKD KLARGIKQQY QETLILDSLN NGSADGDFSA VDMAYALTTG  360
AGNFEVTPEN LRSVLPLINW DKISAMYLPG RSGAECESRW LNCDDPLINH EAWTADEETK  420
LLLIIQEKGM CNWINIAVTL GTHRTPFQCL VRYQRSLNPH ILNKAWTKEE DLQLQAAVET  480
FGQKWQLVSA SLDGRTGTQC SNRWRKTLAP ERTSVGRWLL DEDKRLMVTV KLIGPGRWSL  540
IAPFIPGRTQ TQIFERWCNI LDPDLYLDDW RPEEDSMLLA SVSEFGPCWS KIAKTIIPGR  600
NDSMCYRRWR KLCKHEVQKV REARQLKKAI FQTNFVDREK ERPAICPRDL ISLLPSKGDG  660
CGEATVSGRS KKQGEENLAV SNIANISAGL DCVSANTDLS TGSRRSRRVS SGRRSKQHTE  720
GNNIAVPDDL NASSSAPSSS RKRKSTTGNS VAAKKRLRVS ISVSADNEVE TNKITNSVAV  780
GEEGVVKKRR RRSKLVCNEG ADNEVGANKI MDSVAVGEEG AVKKRRRRSK LVCNEGADNE  840
VGANKIMDSV AVGEEGAVKK RTRRSKPVGN EGAVRKRRGS INRDDEAGTN IRMDPAIGKE  900
GVVKKRTRRS KPVGTEGPAS IGDNGVVKKR TGSVSTENHG AVTKRNRASS RRRKSANLPT  960
EGVPNAATDL NLPCAISEAR VVDAGSMDKG RRKSTPRPKQ LIMSEGDADK NSTFTRLANC  1020
LSFARMKGIN RNKR*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C3e-263826593155MYB PROTO-ONCOGENE PROTEIN
1h89_C3e-263826593155MYB PROTO-ONCOGENE PROTEIN
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1739744SRKRKS
2822827KKRRRR
3858877KKRTRRSKPVGNEGAVRKRR
4949954SRRRKS
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008652002.10.0PREDICTED: uncharacterized protein LOC103631936
RefseqXP_008652003.10.0PREDICTED: uncharacterized protein LOC103631936
TrEMBLA0A096QN110.0A0A096QN11_MAIZE; Uncharacterized protein
STRINGGRMZM2G051793_P010.0(Zea mays)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP65943543
Representative plantOGRP73551617
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.21e-135myb domain protein 4r1